Model Selection

Sparse Attention

# Sparse Attention

Lsg Legal Small Uncased 4096

A compact version of LEGAL-BERT, employing Local+Sparse+Global attention mechanism (LSG) for efficient long-sequence processing

Large Language Model

Transformers English

Bigbird Base Trivia Itc

A fine-tuned model based on bigbird-roberta-base, optimized for trivia QA tasks with long sequence processing support.

Question Answering System English

Bigbird Pegasus Large Bigpatent

BigBird is a Transformer model based on sparse attention, capable of processing sequences up to 4096 in length, suitable for tasks like long document summarization.

Text Generation

Transformers English

Bigbird Roberta Base Finetuned App

A Transformer model based on sparse attention, fine-tuned specifically for mobile app description classification tasks

Text Classification

Transformers English

Bigbird Pegasus Large Arxiv

BigBird is a Transformer model based on sparse attention, capable of handling longer sequences, suitable for tasks like long document summarization.

Text Generation

Transformers English

Bigbird Pegasus Large Pubmed

BigBirdPegasus is a Transformer model based on sparse attention, capable of handling longer sequences, especially suitable for long document summarization tasks.

Text Generation

Transformers English

Bigbird Roberta Large

BigBird is a Transformer model based on sparse attention, capable of processing sequences up to 4096 tokens long, suitable for long document tasks.

Large Language Model English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase